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Abstract —We consider locally repairable codes over small 
fields and propose constructions of optimal cyclic and linear codes 
in terms of the dimension for a given distance and length. 

Four new constructions of optimal linear codes over small 
fields with locality properties are developed. The first two 
approaches give binary cyclic codes with locality two. While the 
first construction has availability one, the second binary code is 
characterized by multiple available repair sets based on a binary 
Simplex code. 

The third approach extends the first one to g-ary cyclic codes 
including (binary) extension fields, where the locality property 
is determined by the properties of a shortened first-order Reed- 
Muller code. Non-cyclic optimal binary linear codes with locality 
greater than two are obtained by the fourth construction. 

Index Terms —Availability, distributed storage, locally re¬ 
pairable codes, Reed-Muller code, Simplex code, sphere-packing 
bound 

I. Introduction 

Locally repairable codes (LRC) can recover from erasure(s) 
by accessing a small number of erasure-free code symbols and 
therefore increase the efficiency of the repair-process in large- 
scale distributed storage systems. Basic properties and bounds 
of LRCs were identified by Gopalan et al. [1], Oggier and 
Datta [2] and Papailiopoulos and Dimakis [3]. The majority 
of the constructions of LRC requires a large field size (see 
e.g. [4]—[6]). The work of Kuijper and Napp [7] considers 
binary LRCs (and over binary extension field). Cadambe and 
Mazumdar [8] gave an upper bound on the dimension of 
a (nonlinear) code with locality which takes the field size 
into account. Goparaju and Calderbank [9] proposed binary 
cyclic LRCs with optimal dimension (among linear codes) for 
distances 6 and 10 and locality 2. 

Our paper is based on the work of Goparaju and Calder¬ 
bank [9] and we use their projection to an additive code 
without locality (see Calderbank et al. [10], Gaborit et al. [11], 
Kim et al. [12] for additive codes). We construct a new family 
of optimal binary codes (with distance 10 and locality 2) 
and generalize the approach to g-ary alphabets. Furthermore, 
we give a construction of optimal binary cyclic codes with 
availability greater than one based on Simplex codes (see 
Pamies-Juarez et al. [13], Rawat et al. [14] for the definition 
of availability and Kuijper and Napp [7] for a Simplex code 
based construction). 

This work has been supported by German Research Council (Deutsche 
Forschungsgemeinschaft, DFG) under grant ZE1016/1-1. 


This paper is structured as follows. Section II gives 
necessary preliminaries on linear and cyclic codes, defines 
LRCs, recalls the generalized Singleton bound, the Cadambe- 
Mazumdar bound [8] as well as the definition of availability 
for LRCs. The concept of a locality code and the projection 
to an additive code are discussed in Section III based on the 
work of Goparaju and Calderbank [9], Two new constructions 
of optimal binary codes are given in Section IV and a con¬ 
struction based on a g-ary shortened cyclic first-order Reed- 
Muller code is given in Section V. The fourth construction 
in Section VI uses code concatenation and provides optimal 
linear binary codes. Section VII concludes this contribution. 

II. Preliminaries 

Let [a, b ) denote the set of integers {a, a + 1,..., b — 1} 
and [ b ) be the shorthand notation for [0, £>). Let F q denote the 
finite field of order q and F g [A] the polynomial ring over ¥ q 
with indeterminate X. A linear [n, k,d\ q code of length n, 
dimension k and minimum Hamming distance d over F f/ is 
denoted by a calligraphic letter like C as well as a non-linear 
(n, M, d) q of length n, cardinality M and minimum distance 
d. 

An [n, k, d] q g-ary cyclic code C with distance d is an ideal 
in the ring F 9 [A']/(A' n —1) generated by g{X). The generator 
polynomial g(X) has roots in the splitting field F g », where 

n | (q s - 1). 

A g-cyclotomic coset M vn is defined as 

M i>n d = {iq J mod n\j G [a)}, (1) 

where a is the smallest positive integer such that iq a = i mod 
n. The minimal polynomial in F g [A] of the element a 1 G F ?m 
is given by to.;(A) = n (A — cF). The defining set Dq 

of an [n,k,d\ q cyclic code C is 

D c = {0 < i < n- 1 \g{of) = 0}. (2) 

For visibility we sometimes mark a position i with a □ if 
g(a l ) ^ 0. Furthermore, let Df'' be the short-hand notation 
for {(z + z) | i G Dc} for a given ; G 1 Let us recall the 
definition of linear locally repairable codes. 

Definition 1 (Locally Repairable Code (LRC)). A linear 
[n, fc, d] q code C is said to have ( r,6)-loca!ity if for all n 
code symbols c*, Vz G \n), there exists a punctured subcode of 


C with support containing i, whose length is at most r + <$ — 1, 
and whose minimum distance is at least 5. 


A code C is called r-local if it has (r, 2)-locality. 

The following generalization of the Singleton bound for 
LRCs was among others proven in [15, Thm. 3.1], [6, Con¬ 
struction 8 and Thm. 5.4] and [16, Thm. 2], 


Theorem 2 (Generalized Singleton Bound). The minimum 
distance d of an [n, k , d\ q linear (r, 5)-locally repairable code 
C (as in Def 1) is upper bounded by 


d < n — k + 1 — 



l) (s-i). 


(3) 


For i5 = 2 and r = k it coincides with the classical Singleton 
bound. Throughout this contribution we call a code Singleton- 
optimal if its distance meets the bound in Thm. 2 with equality. 
The generalized Singleton bound as in Thm. 2 does not take 
the field size into account. We compare our constructions with 
the bound given by Cadambe and Mazumdar [8, Thm. 1] 
which depends on the alphabet size. In general it holds also 
for nonlinear codes, but we state it only for linear codes in 
the following. 


Theorem 3 (Cadambe-Mazumdar (CM Bound)). The dimen¬ 
sion k of an r-local repairable code C of length n and 
minimum Hamming distance d is upper bounded by 

k < min jfr + k ( 0 9 p ] (n - t(r + 1), d) j , (4) 

where kf^ t (n,d) is the largest possible dimension of a code 
of length n, for a given alphabet size q and a given minimum 
distance d. 


In the following we use Thm. 3 to bound the dimension 
of linear codes. Another important parameter for LRCs is 
the availability e.g. considered in Kuijper and Napp [7] and 
Cadambe and Mazumdar [8] and therefore we define it in the 
following. 

Definition 4 (Availability). An [n,k,d\ q linear code C is 
called t-available-r-local locally repairable if every code 
symbol Cj,V* € [n), has at least t parity-checks of weight 
r + 1 which intersect pairwise in (and only in) {*}. 

III. Locality Code and Additive Code 

In this section we shortly recall the approach of Goparaju 
and Calderbank [9] and extend it to what we call a locality 
code. 

Let us first describe the idea of Constructions 1 and 2 of 
[9] in terms of a locality code. Construction 1 of [9] gives a 
binary cyclic code C of length n = 2"' 1 with locality r, 

where the code length n is divisible by r +1. The defining set 
is Dq = {i mod (r + l),Vi € [n)}. This equals the union of 
n/(r + 1) shifted defining sets Dc = {0} of the binary cyclic 
[r + 1, r, 2] 2 single-parity check code, which is able to correct 
one erasure within a block of length r +1 by “accessing” only 
r other code symbols. The code C inherits the properties of £, 
also the minimum distance of two, which is Singleton-optimal, 


but does not increase the overall erasure-correction capability. 
In general, let C be the aimed \n. k. d) q code with locality 
properties that are inherited from an [ni, hi, di\ q locality code 
£. Namely the code constructions of [9] and our (cyclic) codes 
are subcodes of the [n, k, d\ q cyclic product code £®T, where 
T is the trivial [n/n;,n/rq, l\ q code (see [17]), i.e., a cyclic 
code with defining set: 

D c = U D l p ] U • • • U D [ ™- ni ~ 1] U . (5) 

In Construction 2 (R = M l ra ) and 3 (R = M l n U M_ 1>n ) of 
[9], the locality code £ is a [3, 2, 2]2 single-parity check code 
with defining set Dc = {0}. The optimality among binary 
codes with locality r = 2 is shown via the projection to an 
additive code. 


Lemma 5 (Projection to Additive Code). Let £ be an 
[ni, ki, di] q locality code and let C be an [n,k,d] q code with 
defining set as in (5). Then, we can project each sub-block of 
ni symbols of a codeword in C to one symbol in . The 
obtained (n 1 , q k , d')^ additive code A has parameters: 

n' = n/ni and d' > \d/ui\, (6) 


where u> is the maximum weight of a codeword in £. 


Proof: The length n' and the alphabet-size follow directly 
from the projection of the coordinates. The cardinality of A 
equals the one of C. The distance follows from the fact that in 
the worst-case uj non-zero symbols of C are projected to one 
symbol over F q k t (see Fig. 1). ■ 



Fig. 1. Illustration of a nonzero minimum-weight codeword of weight seven 
of the [n, k, d] q code C arranged in n/ni blocks of length nj. The ★ marks a 
nonzero symbol in ¥ q . The corresponding codeword of the additive code A 
over ¥ k t has length n/ni and has at least weight \d/ui |. Here uj = 4 and 
therefore at least two symbols in A are nonzero (first and before last column). 
The redundancy added by the [ni,ki,di\ q locality code £ is illustrated as gray 
symbols, while the black symbol marks the additional redundancy to obtain 
a distance that is higher than the one given by the Singleton bound. 


For a cyclic binary [r + 1, r, 2)2 single-parity check code of 
odd length, the maximum weight of a codeword is u> = r. 































Lemma 6 (Locality Code). If £ is an [n; = r + S — 1, r, <5] g 
MZ)S locality code, then the cyclic code C with defining set as 
in (5) has (r, S)-locality and its distance is d > 5. 

Proof: The (r, 5)-locality of C follows directly from the 
construction. The distance of C C £®T is at least the distance 
of the product code C® T. ■ 

Note that for R = 0, the code C = £ ® T is Singleton- 
optimal, i.e., the dimension of C is k = nr/(r + <5 — 1) and 
from (3) we obtain: 


d < n — 


nr 


r + 5 - 1 


1 - 


r + 8 - 1 


-1 05 - 1 ) 


n(r + 6 — 1) — nr — n(5 — 1) 

< —----- + S = 6. 
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IV. Binary Cyclic Codes With Locality Two 


Construction 1 in [9] gives Singleton-optimal binary cyclic 
codes (these codes are of lowest-rate for d = 2 see [18, Prop. 
2]). The following construction gives a new class of binary 
cyclic 2-local codes. 

Construction 7 (Binary Reversible Codes). Let n = 2 m + 1 
and 3|n and therefore m odd. Let the locality r = 2, i.e. let 
£ be a [3, 2, 2] 2 single-parity check code with Dc = {0}. Let 
the defining set be 


D c = {{...,-6,-3,0,3,6,9,12,... }U AT,,„}, 

= {...,-6, -4,-3,-2,-1,0,1,2,3,4,6,...}. 


Then C has dimension k = |(2 m + 1) — 2m and distance 

d > 10. 

Due to the length, the coset Mi >n is reversible (see [19] 
for reversible codes), i.e., M-\ n = {1, 2,..., 2 m_1 , 2 m = 
— 1, — 2,..., — 2 m ~ 1 } and has cardinality 2m. The distance 
follows from the BCH bound [20] , [21], where the consecutive 
sequence is —4, —3,..., 3,4. 

Theorem 8. Let a binary linear code with parameters as in 
Construction 7 be given. Then its dimension satisfies: 

k < \{2 m + 1) - 2m. (7) 

O 


Proof: Equivalent to the proof of [9, Thm. 2], we 
have via sphere-packing bound (see [22, Ch. 1 §5]) for the 
(2n/3, 2 fc , 5) 2 2 additive code that 


2 k < 


4 n ' 

1 + 3n' + 


and therefore 

k < log 2 (4 n/3 ) - log 2 ^1 - y + y 2 j 
= y + 1 - riog 2 (2 ~n + n 2 )]. 


( 8 ) 


( 9 ) 


With n = 2 m + 1, we obtain from (9) 

On 

k=- + l- [log 2 (2 - (2 m + 1) + (2 m + 1) 2 )1 

On 

= y + l-riog 2 (2 + 2 m + 2 2m )] 

2 n , 2n 

= — + 1 — (2m + 1) = —-2 to. 

o o 

■ 

Remark 1: A binary cyclic code as in Construction 7 
without Mi >n in the defining set is Singleton-optimal and the 
distance equals d = 2 (for k = 2n/3, r = 2 and 5 = 2), which 
is the smallest minimum distance possible for a binary cyclic 
code with rate 2/3 (see [18, Prop. 2]). 

Remark 2: The nth root of unity is in F 2 2 m_! (which is twice 
the extension order of the code obtained via [9, Construction 
3]) and therefore the (non-local) decoding complexity is higher 
than [9, Construction 3]. 

Example 9 (Optimal 2-Local Binary Code). Let n = 2 5 +1 = 
33 and via Construction 7 we obtain a binary cyclic code of 
minimum distance d = 10, with 

D c = {{0,3,6,9,..., 30} U {1,2,4,..., 32}} 

and dimension k = 12, which is 2-local. The CM bound (see 
Thm. 3) based on the best-known linear codes give k < 13. 

Let us consider Construction 4 of [9] based on the [7, 3,4] 2 
locality code £ with locality r = 2, availability t = 3 and 
with debiting set Dc = {0, □, □, 3, □, 5, 6}. The code C with 
debiting set as in (5), but with R = 0 is an [n = 2 m — 1, k = 
3n/7,4] 2 cyclic code, where to is a multiple of three. 

We extend Construction 4 of [9] to obtain a higher distance 
and small reduction of the rate as follows. 

Construction 10 (Sphere-Packing Optimal Binary Code with 
Locality Two and Increased Availability). Let n = 2 m — 1 and 
be divisible by 7 and therefore 3|to. Let the defining set be: 

D c = -9, -8, | - 7, □, □, -4, □, -2, -1, |0, □, □, 

3, □, 5,6,17, □, □, 10, □, □, 12,13|,... } 

U M ljn = {1, 2,4,..., 2” 1-1 }}. 

Then d > 12 (via BCH bound [20], [21], where the con¬ 
secutive sequence is —2,1,..., 8). The constructed [n = 
2 m — 1, k = 3n/7 — to, d 0 12] 2 cyclic code C is a 2-local 
code and has availability t = 3 as defined in Def. 4. 

Example 11 (2-Local Binary Code with Availability Three). 
Let n = 9 • 7 = 63, Dc = {0, □, □, 3, □, 5,6} and let the 
defining set be 

D c = {D c u4 7I U-U D l Z 6] U M 1i63 } 

= 59,61,62, |0,3,5,6, |7,10,..} U {1,2,4,.., 32}}. 

The constructed code C is an [63,21,12] 2 code and the 
corresponding additive code according to Lemma 5 is a 
(9 , 2 7 , 3) 2 3 code. The BCH bound is tight and the consecutive 
sequence ranges is 61, 62, 0,1,..., 8. 







Let us prove the optimality of Construction 10 in the 
following theorem. 


Theorem 12. Let 3| m and let C be an [2 m — 1, k , 12] 2 linear 
code, let C be the [7, 3, 4 ]2 Simplex code and let C have the 
locality inherited from C as in Lemma 5. Then, 

k < y (2 m - 1) - to. (10) 

Proof: From (6) we have an (n' = n/7,2 k ,d' = 
[12/4] = 3) 2 3 additive code A. The [7,3,4] 2 Simplex code 
is a constant-weight code and therefore u> = 4. The code A is 
defined over F 2 3 and has dimension 


k! = k/3 


1 

3 



= n — m . 


and therefore the parameters of a [((2 3 ) m ' — 1)/(2 3 — 1), n' — 
to', 3] 2 3 Hamming code, which is optimal w.r.t. the sphere¬ 
packing bound. ■ 

Construction J_0 can be extended to the case where the 
locality code C is the [2 a — l,a, 2 a_1 ] 2 cyclic Simplex code, 
which is 2-local and has availability t = 2 a ~ 1 — 1 (see e.g. 
Kuijper-Napp [7, Lemma 3.1] and Wang-Zhang [23]). 


V. Q-Ary Case: First-Order Shortened RM Code as 
Locality Code 

We extend the previous approach for cyclic codes to the 
q- ary case and use as locality code C the q- ary 

[q 2 - 1, 2, (q - l)q 2 ~\ = [q 2 - 1, 2, q 2 - q] g (13) 

cyclic shortened first-order Reed-Muller (RM, see [24, Prob¬ 
lem 2.17] and [25, Section 6.11]) code. Its dual code is 
the [q 2 — 1 , g 2 — 3, 2] q code with defining set {l,g}. A 
[q a — 1 ,a,q a ~ 1 (q — l)] g shortened first-order RM code is 
the q- ary pendant of the Simplex code and also a constant- 
weight code with u> = q a ~ 1 (q—l). RM codes have the highest 
minimum distance possible for the given parameters among q- 
ary linear codes. Furthermore, first-order RM codes and their 
locality properties were investigated by Rawat and Vishwanath 
in [26]. 

Construction 15 (Reed-Muller Code Locality). Assume q > 
2. Let C be an [q m — 1, At, d] q code. Let C be an [q 2 — 1, 2, q 2 — 
q] q cyclic RM code with defining set 

Dc = {0, □, 2, 3,..., q - 1, □, q + 1,..., q 2 - 2] . 


Construction 13 (Binary Code with Simplex Locality). Let 
n = 2 m — 1 and be divisible by 2“ — 1 and therefore a\m. Let 
C be the [2 a — 1, a, 2 a-1 ] 2 cyclic Simplex code with defining 
set 

D c = {0, □, □, 3, □, 5, 6,.., □, 2 a_1 + 1,2“ — 1} . (11) 

Let the defining set of the code C be: 

D c = { D c U D [ £~ 1] U £)P (2 “ -1)i u ... u Mi,„} 

= {..., —2 a_1 + 1,..., -1|, 0,1,..., 2° - 1, |2 a ,... }. 

Then d > 2° + 2 a ^ 1 (via BCH bound for the consecutive 
sequence from — (2 a ~ 1 — 1) to 2 a ) and the dimension is k = 
9(2 m -l)-m. 

We have the following theorem on the optimality of the 
dimension of linear codes. 


Theorem 14 (Simplex Locality). Let a\m and let C be an 
[2 m — 1, k, 2 a—1 • 3] 2 linear code, let C be the [2 a — l,a, 2 a-1 ] 2 
binary Simplex code and let C have the locality properties 
according to C as in Lemma 5. Then, 

k ~¥~l d2) 


Proof: From (6) we have an (n/( 2“ — 

1), 2 k , [d/(2 a-1 )] ) 2 a additive code A, where ui = 2 a — 1, 
because the simplex code is a constant-weight code. The 
additive code has the parameters of a Hamming code over 
F 2 a with dimension 


k! = k/a = 


1 

a 



= n' — to', 


and distance 


' d ' 


"2 a_1 (l + 2)" 

2 a ~ 1 


2° —1 


Let the defining set of C be: 

D C = { D C U D [ l~ 1] • ■ • U {1, q, q 2 ,..., g™' 1 }} 
= {D, (g 2 — q — 2),0,.., q 2 + q — 2, □,..}. 


Then the dimension of C is k = c 2 ™ 1 — m and the distance 
is d > q 2 — q — 2 + q 2 + q — 2 + 1 + 1 = 2g 2 — 2 via BCH 
bound [20], [21 ] (from —(q 2 — q — 2).. + (q 2 + q — 2)). 


Theorem 16 (Locality Code: Shortened First-Order RM 
Code). Let C be an [q m — l,fc, (q 2 — q) ■ 3] 2 linear code, 
let C be the [g 2 — 1, 2, q 2 — q] q RM code and let C have the 
locality properties according to C as in Lemma 5. Then, 

q z — 1 

Proof: The additive code A has parameters 
, q m — 1 

n = ~2 — T’ 
q z — 1 

k' = k/2, 

q -q 

and has alphabet-size F„ 2 . More explicitly, the dimension is: 


k’ = k/2 = - 

and the distance is 


1 (2(q m - 1) 


- 1 


— to = n — to 


d 

> 

"2g 2 — 2" 


"g 2 (2 — 2/g 2 )" 

q 2 -q 
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7 2 (1 - l) 


Therefore the additive code has the parameters of an g 2 -ary 
Hamming code, which is optimal w.r.t. to the sphere-packing 
bound. ■ 
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Example 17 (Optimal Ternary Code). Let q = 3 and let n = 
3 4 — 1 = 80. Let C be the [ 8 , 2, 6 ] 3 shortened first-order cyclic 
RM code with defining set is Dc = {0, □, 2, □, 4, 5, 6 , 7}. 
Then, the defining set of C according to Construc¬ 
tion 15 is Dc = —4,0, □, 2, □, 4, 8 , □, 10, □,..} U 

{1,3,9,27}} and therefore the BCH bound gives d > 16 
(—4. .10). And thus d' = [16/6] = 3. 

Construction J_5_ is also valid for extension fields and there¬ 
fore let us give another example over a binary extension field. 

Example 18 (Cyclic Optimal Code over Binary Ex¬ 
tension Field). Let q = 2 2 and let n = 4 4 — 

1 = 255. Let C be the [15,2,12]4 RM code with 
defining set Dc = {0, □, 2, 3, □, 5,..., 14}. Then, the 
defining set of C according to Construction 15 is 
D c = -10,.., 0, □, 2,3, □, 5,15, □, 17,18, □“} U 

{1,4,16, 64}}. The BCH bound gives d > 30 (from —10..18). 
The additive code over F 2 2 has length v! = 255/15 = 17, 
dimension k' = 17 — 4 = 13 and distance d' = [30/12] = 3. 

The real distance of the codes via Construction 15 is 3(q 2 — 
q), but the BCH bound is not tight. Other bounds for cyclic 
codes can deliver a better result and a more advanced algebraic 
decoders for the non-local erasure-decoding can be applied. 

VI. Optimal Linear Codes 

Based on concatenated codes [27], [28], we propose a 
construction, where the row-code is a linear Hamming code 
over the binary extension field F 2 r. 

Construction 19 (Binary Linear r-Local Code). Let the row 
code be a [(2 2r -l)/(2 r -l) = 2 r + l, 2 r + l-2 = 2 r -l,3] 2 r- 
binary Hamming code and let the column-code C be a linear 
(not necessarily cyclic) [r + 1 , r, 2] 2 single-parity check code. 
Then the concatenated coded code is a \(2 r + l)(r+ 1), (2 r — 
l)r, 6] 2 r-local code. 

Construction 19 gives an r-local linear code, with highest 
possible dimension k. (The additive codes is a binary Ham¬ 
ming code.) 

Example 20 (Binary Code with Locality r = 3). Let the 
[((2 3 ) 2 — 1)/ (2 3 — 1) = 9,9 — 2 = 7, 3] 2 3 Hamming code be 
the row code of a concatenated code and let the column code 
be the [4, 3,2] 2 single-parity check code that corresponds to 
locality r = 3. Then C is a [36, 21,6] 2 3-local linear code. 
The CM bound (Thm. 3) gives k < 21. 

VII. Conclusion and Outlook 

We proposed new constructions of optimal binary and <y-ary 
cyclic (and linear) codes with locality and availability. 

The following future work seems fruitful. The extension of 
Construction 13 to codes with higher distance and optimal 
dimension, the usage of other first-order cyclic Reed-Muller 
codes as locality code similar to Construction 15, the usage 
of improved bounds on the minimum distance for the cyclic 
codes obtained via Construction _L5 and the extension of 
Construction 19 to q-ary linear codes. 
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